A dynamic programming approach to efficient sampling from Boltzmann distributions
نویسندگان
چکیده
Several algorithms in optimization, computer science and statistical mechanics are closely related to the problem of sampling from a Boltzmann distribution parameterized by the so-called temperature over a finite set. These include (i) Simulated Annealing (SA) [6], (ii) calculating the permanent of a nonnegative matrix [1], (iii) estimating the volume of a convex body [7], and (iv) computing partition functions of interacting particle systems such as the Ising model [8]. The finite set involved is typically exceedingly large rendering exact Boltzmann-sampling impractical. Hence one approach is to simulate “several” transitions of an ergodic Markov chain with the appropriate Boltzmann as its limiting distribution so that the final state-distribution wellapproximates this Boltzmann. More precisely, to sample from a Boltzmann distribution at temperature T∗ > 0, it is common to simulate a sequence of ergodic Markov chains whose limiting distributions are also Boltzmann at temperatures given by a “cooling schedule”—a strictly decreasing finite sequence of temperatures starting at a very high value, commonly ∞, and ending at T∗. Initial motivation for implementing the sampling procedure in phases defined by a cooling schedule came from annealing processes in physics where a glass or metal is toughened by cooling it slowly, starting at a high temperature, to a low temperature equilibrium. The mathematical intuition behind gradual cooling is to implement phases such that the distribution of the state at the end of one phase is not very different from the limiting distribution of the next phase with the hope that this will reduce the total number of iterations required to well-approximate the target Boltzmann. This concept is termed a “warm start” and has been featured in recent work in this area [5]. A search for an “optimal” cooling strategy must consider several key questions including how to select the number of Markov transitions in each phase. Specifically, given a fixed number of total iterations (this may typically arise from knowledge of available computational power, and computation time), running too many iterations in early phases may be wasteful, whereas too few may leave us “far away” from the limiting distributions of subsequent phases (“cold start”).
منابع مشابه
A Simplified Curved Boundary Condition in Stationary/Moving Boundaries for the Lattice Boltzmann Method
Lattice Boltzmann method is one of computational fluid dynamic subdivisions. Despite complicated mathematics involved in its background, end simple relations dominate on it; so in comparison to the conventional computational fluid dynamic methods, simpler computer programs are needed. Due to its characteristics for parallel programming, this method is considered efficient for the simulation of ...
متن کاملAnnealing Gaussian into ReLU: a New Sampling Strategy for Leaky-ReLU RBM
Restricted Boltzmann Machine (RBM) is a bipartite graphical model that is used as the building block in energy-based deep generative models. Due to numerical stability and quantifiability of the likelihood, RBM is commonly used with Bernoulli units. Here, we consider an alternative member of exponential family RBM with leaky rectified linear units – called leaky RBM. We first study the joint an...
متن کاملBankruptcy Prediction: Dynamic Geometric Genetic Programming (DGGP) Approach
In this paper, a new Dynamic Geometric Genetic Programming (DGGP) technique is applied to empirical analysis of financial ratios and bankruptcy prediction. Financial ratios are indeed desirable for prediction of corporate bankruptcy and identification of firms’ impending failure for investors, creditors, borrowing firms, and governments. By the time, several methods have been attempted in...
متن کاملAn Optimal Tax Relief Policy with Aligning Markov Chain and Dynamic Programming Approach
Abstract In this paper, Markov chain and dynamic programming were used to represent a suitable pattern for tax relief and tax evasion decrease based on tax earnings in Iran from 2005 to 2009. Results, by applying this model, showed that tax evasion were 6714 billion Rials**. With 4% relief to tax payers and by calculating present value of the received tax, it was reduced to 3108 billion Rials. ...
متن کاملExpected Duration of Dynamic Markov PERT Networks
Abstract : In this paper , we apply the stochastic dynamic programming to approximate the mean project completion time in dynamic Markov PERT networks. It is assumed that the activity durations are independent random variables with exponential distributions, but some social and economical problems influence the mean of activity durations. It is also assumed that the social problems evolve in ac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Oper. Res. Lett.
دوره 36 شماره
صفحات -
تاریخ انتشار 2008